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Abstract 

In this paper we introduce a new notion of convergence of sparse graphs which we call 
Large Deviations or LD-convergence and which is based on the theory of large deviations. 
The notion is introduced by "decorating" the nodes of the graph with random uniform 
i.i.d. weights and constructing random measures on [0, 1] and [0, l] 2 based on the decora- 
tion of nodes and edges. A graph sequence is defined to be converging if the corresponding 
sequence of random measures satisfies the Large Deviations Principle with respect to the 
topology of weak convergence on bounded measures on [0, l] d ,d = 1,2. We then estab- 
lish that LD-convergence implies several previous notions of convergence, namely so-called 
right-convergence, left-convergence, and partition-convergence. The corresponding large 
deviation rate function can be interpreted as the limit object of the sparse graph sequence. 
In particular, we can express the limiting free energies in terms of this limit object. 

Finally, we establish several previously unknown relationships between the formerly 
defined notions of convergence. In particular, we show that partition-convergence does 
not imply left or right- convergence, and that right-convergence does not imply partition- 
convergence. 
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1 Introduction 



The theory of convergent graph sequences for dense graphs is by now a well-developed subject, 
with many a priori distinct notions of convergence proved to be equivalent. For sparse graph 
sequences (which in this paper we take to be sequences with bounded maximal degree), much less 
is known. While several of the notions defined in the context of dense graphs were generalized 
to the sparse setting, and some other interesting notions were introduced in the literature, the 
equivalence between the different notions was either unknown or known not to hold. In some 
cases, it was not even known whether one is stronger than the other or not. In this paper, we 
introduce a new natural notion of convergence for sparse graphs, which we call Large Deviations 
Convergence, and relate it to other notions of convergence. We hope that this new notion will 
also ultimately allow us to relate many of the previous notions to each other. 

Let G n be a sequence graphs such that the number of vertices in G n goes to infinity. The 
question we address here is the notion of convergence and limit of such a sequence. 

For dense graphs, i.e., graphs for which the average degree grows like the number of vertices, 
this question is by now well understood |BCL+06| . |BCL+08| . |BCL+ 12j . These works introduced 
and showed the equivalence of various notions of convergence: left-convergence, defined in terms 
of homomorphisms from a small graph F into G n ; right-convergence, defined in terms of homo- 
morphisms from G n into a weighted graph EI with strictly positive edge weights; convergence in 
terms of the so-called cut metric; and several other notions, including convergence of the set of 
quotients of the graphs G n , a notion which will play a role in this paper as well. The existence 
and properties of the limit object were established in |LS06j . and its uniqueness was proved 
in |BCL10j . Some lovely follow up work on the dense case gave alternative proofs in terms of 
exchangeable random variables [DJ07] , and provided applications using large deviations |CVllj . 

For sparse graph sequences much less is known. In |BCKL13] . left- convergence was defined 
by requiring that the limit 



exists for every connected, simple graph F, where hom(F, G) is used to denote the number of 
homomorphisms from F into G. Here and in the rest of this paper V(G) denotes the vertex set 
of G. Also E(G) denotes the edge set of G and for u, v 6 V(G) we write u ~ v if (u, v) G E(G). 
It is easy to see that for sequences of graphs with bounded degrees, this notion is equivalent to 
an earlier notion of convergence, the notion of local convergence introduced by Benjamini and 
Schramm pSOlJ. 

To define right-convergence, one again considers homomorphisms, but now from G n into a 
small "target" weighted graph EL Each homomorphism is equipped with the weight induced by 
EL Since the total weight of all homomorphisms denoted by hom(G n ,EI) is at most exponential 
in |V(G n )|, it is natural to consider the sequence 



which in the terminology of statistical physics is the sequence of free energies. In [B CKL13] 
a sequence G n was defined to be right-convergent with respect to a given graph EI if the free 
energies f(G n , EI) converge as n — > oo. It was shown that a sequence G n that is right-convergent 
on all simple graphs EI is also left-convergent. But the converse was shown not to be true. The 






(2) 
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counterexample is simple: Let C n be the cycle on n nodes, and let EI be a bipartite graph. Then 
hom(C n ,EI) = for odd n, while hom(C n ,EI) > 1 for even n. 

This example, however, relies crucially on the fact that EI encodes so-called hard-core con- 
straints, meaning that some of the maps : V(G n ) — > V(M) do not contribute to hom(G n ,H). 
As we discuss in Section 4, such hard-core constraints are not very natural in many respects, 
one of them being that the existence or value of the limit of the sequence ^ can change if we 
remove a sub-linear number of edges from G n (in the case of C n , just removing a single edge 
makes the sequence convergent, as can be seen by a simple sub-additivity argument). Hard-core 
constraints are also not very natural from a point of view of applications in physics, where hard- 
core constraints usually represent an idealized limit like the zero-temperature limit, which is a 
mathematical idealization of something that in reality can never be realized. 

We therefore propose to define right-convergence by restricting ourselves to so-called soft- 
core graphs, i.e., graphs EI with strictly positive edge weights. In other words, we define G n 
to be right-convergent iff the free energy defined in ^ has a limit for every soft-core graph 
EI. This raises the question of how to relate the existence of a limit for models with hard-core 



constraints to those on soft-core graphs, a question we will address in Section |4.2| below, but 
we do not make this part of the definition of right-convergence. Given our relaxed definition 
of the right-convergence, the question then remains whether left-convergence possibly implies 
right-convergence. The answer turns out to be still negative, and involves an example already 



considered in |BCKL13"] . see Section 5.1 below. 

Next let us turn to the notion of partition-convergence of sparse graphs, a notion which 
was introduced by Bollobas and Riordan |BR11] , motivated by a similar notion for dense graphs 
from |BCL+08j . |BCL+12j . where it was called convergence of quotients. We start by introducing 
quotients for sparse graphs. Fix a graph G and let a — (Vx, . . . , V&) be a partition of V(G) into 
disjoint subsets (where some of the V^'s may be empty). We think of this partition as a (non- 
proper) coloring a : V{G) —> [k] with Vi = cr _1 (i),l < i < k. The partition a then induces a 
weighted graph F = G/a on [k] = {1, . . . , k} (called a fc-quotient) by setting 

\Vi\ 1 
Xi(a) = — and X^a) = — \{u 6 V h v <E Vj : u ~ v}\ . (3) 

Here x(a) = (xi(a), 1 < i < k) and X(a) = (Xij(a), 1 < i, j < k) are the set of node and edge 
weights of F, respectively. Then for a given k, the set of all possible fc-quotients (x(a), X(a)) 
is a discrete subset Sk(G) C IR^" 1 " 1 ^. Henceforth K(R + ) denotes the set of all (non-negative) 
reals, and Z(Z + ) denotes the set of all (non-negative) integers. Motivated by the corresponding 
notion from dense graphs, one might want to study whether the sequence of fc-quotients Sk(G n ) 
converges to a limiting set Sk C R^ 4 " 1 ^ with respect to some appropriate metric on the subsets of 
jj^(fc+i)fc rjnj^-g j eac j g £ notion of partition-convergence, a notion introduced in [BRllj . Since 
it is not immediate whether this notion of convergence implies, for example, left-convergence 
(in fact, one of the results in this paper is that it does not), Bollobas and Riordan |BR11] 
also introduced a stronger notion of colored-neighborhood-convergence. This notion was further 
studied by Hatami, Lovasz and Szegedi [HLS12J. An implication of the results of these and 
earlier papers (details below) is that colored-neighborhood-convergence is strictly stronger than 
the notion of both partition- and left-convergence, but it is not known whether it implies right- 
convergence. 
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We now discuss the main contribution of this paper: the definition of convergent sparse graph 
sequences using the formalism of large deviations theory. We define a graph sequence to be Large 
Deviations (LD)-convergent if for every k, the weighted factor graphs F = G/a defined above 
viewed as a vector (xj(F), 1 < i < k) and matrix (Xy(F), 1 < i, j < k), satisfy the large deviation 
principle in R fc and M. kxk , when the fc-partition a is chosen uniformly at random^ Intuitively, 
the large deviations rate associated with a given graph F provides the limiting exponent for 
the number colorings a such that the corresponding factor graphs G/a is approximately F. It 
turns out that the large deviations rates provide enough information to "read off" the limiting 
partition sets <S&. Those are obtained as partitions with finite large deviations rate. Similarly, 
one can "read off" the limits of free energies (|2]). As a consequence, the LD-convergence implies 
both the partition and right-convergence. 

The deficiency of the definition above is that it requires that the large deviations principle 
holds for a infinite collection of probability measures associated with the choice of k. It turns out 
that there is a more elegant unifying way to introduce LD-convergence by defining just one large 
deviations rate, but on the space of random measures rather than the space of random weighted 
graphs G n /a. This is done as follows. Given a sequence of sparse graphs G n , suppose the 
nodes of each graph are equipped with values (cr(u), u e V(G)) chosen independently uniformly 
at random from [0, 1]. We may think of these values as real valued "colors". From these values 
we construct a one-dimensional measure p n = J2 u ev(G) l^(^ri)| _1 ^(c r (^)) on [0,1] and a two- 
dimensional measure /i n = ^2 UV . ur ^ v |X / (G n )| _1 5(o"('u), u(v)) on [0, l] 2 . Here 5(x) is a measure 
with unit mass at x and zero elsewhere. As such we obtain a sequence of measures (p n ,Pn)- We 
define the graph sequence to be LD-convergent if this sequence of random measures satisfy the 
Large Deviations Principle on the space of measures on [0, l] d , d = 1, 2 equipped with the weak 
convergence topology. One of our first result is establishing the equivalence of two definitions of 
the LD-convergence. To distinguish the two, the former mode of convergence is referred to as 
fc-LD-convergence, and the latter simply as LD-convergence. 

Our most important result concerning the new definition of graph convergence is that LD- 
convergence implies right (and therefore left) as well as partition-convergence. We conjecture 
that it also implies colored-neighborhood-convergence but we do not have a proof at this time. 
Finally, we conjecture that LD-convergence holds for a sequence of random D-regular graphs with 
high probability (w.h.p.), but at the present stage we are very far from proving this conjecture 
and we discuss important implications to the theory of spin glasses should this convergence be 
established. 

Let us finally return to a question already alluded to when we defined right- convergence. 
From several points of view, it seems natural to consider two sequence G„ and G n on the same 
vertex set V n = V(G n ) = V(G n ) to be equivalent if the edge sets E(G n ) and E(G n ) differ 
on a set of o(|l^|) edges. One natural condition one might require of all definitions of con- 
vergence is the condition that convergence of G n implies that of G n and vice versa, whenever 
G n and G n are equivalent. We show that this holds for all notions of convergence consid- 
ered in the paper, namely left-convergence, our modified version of right-convergence, partition- 
convergence, colored-neighborhood-convergence, and LD-convergence. We also show that for all 
right-convergent sequences G n , one can find an equivalent sequence G„ such that on G n , the lim- 
iting free energy /(H) = lim^oo f(G n , H) exists even when EI contains hard-core constraints, see 

1 Some necessary background on Large Deviations Principle and weak convergence of measures will be provided 
in Section |3l 
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Theorem [3] below. Our alternative definition of right-convergence can therefore be reexpressed 
by the condition that there exists a sequence G n equivalent to G n such that the free energies 
/(G n ,EI) converge for all weighted graphs HI. 

The organization of this paper is as follows. In the next section we introduce notations and 
definitions and provide the necessary background on the large deviations theory. In Section [3] 
we introduce LD-convergence, establish some basic properties and consider two examples of LD- 
convergent graph sequences. Other notions of convergence are discussed in Section |4j In the 
same section we prove Theorem [3] relating free energy limits with respect to hard-core and soft 
models, discussed above. The relationship between different notions of convergence is considered 
in Section [5j The main result of this section is Theorem [4j which establishes relationships between 
different notions of convergence. Figures [3] and [4] are provided to illustrate the relations between 
the notions of convergence, both those known earlier and those established in this paper. In the 
last Section [6] we compare the expressions for the limiting free energy to those in the dense case 
(pointing out that the large-deviations rate function plays a role quite similar to the role the 
limiting graphon played in the dense case) and discuss some open questions. 

2 Notations and definitions 

For the convenience of the reader, we repeat some of the notations already used in the intro- 
duction. We consider in this paper finite simple undirected graphs G with node and edge sets 
denoted by V(G) and E(G) respectively. We write u ~ v for nodes u and v when (u, v) G E{G). 
For every node u G V(G), M{u) denotes the set of neighbors of u, namely all v G V(G) such that 
u ~ v. Sometimes we will write in order to emphasize the underlying graph. The number 

A G (w) of vertices in Af^u) is called the degree of u in G. We use A G to denote the maximum 
degree max u Aq(u). 

A path of length r is a sequence of nodes uo,---,u r such that Ui ~ Ui + \ for all i = 
0, 1, 2, . . . , r — 1. The distance between nodes u and v is the length of the shortest path w , • • • , u r 
with uq = u and u r = v. The distance is assumed to be infinite if u and v belong to different 
connected components of G. Given u and r, let B(u, r) = B G (u, r) be the subgraph of G induced 
by nodes with distance at most r from u. Given a set of nodes A C V and r > 1, B(A, r) is the 
graph induced by the set of nodes v with distance at most r from some node u G A. 

Given two graphs G, EI a map a : V(G) — > V(M) is a graph homomorphism if (u, v) G E(G) 
implies (a(u),a(v)) G E(M) for every u, v G V(G). Namely, a maps edges into edges, a : 
G\ —7- G 2 is called a graph isomorphism if it is a bijection and (u,v) G E(Gi) if and only if 
{<r(u), u(v)) G E(G2), i.e., a is an isomorphism which maps edges into edges, and non-edges into 
non-edges. Two graphs are called isomorphic if there is a graph isomorphism mapping them 
into each other. Two rooted graphs (G\,x) and (G2,|/) are called isomorphic if there is an 
isomorphism a : Gi — > G 2 such that cr(x) = y. 

The edit distance between two graphs G and G with the same number of vertices is defined 

as 

5edit(G,G) =min G ,(|E(G / ) \E(G)\ + \E(G) \ E(G')\j 

where the minimum goes over all graph G' isomorphic to G. 

Given two graphs G, EI, we use Hom(G, EI) to denote the set of all homomorphisms from G 
to EI, and hom(G, EI) = | Hom(G, EI)| to denote the number of homomorphisms from G to EI. 
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If EI is weighted, with node weights given by a vector (ai,i G V(H)) and edge weights given 
by a symmetric matrix (Aij,(i,j) G E(M)) (which we denote by a = a(H) and A = A(M) 
respectively), then hom(G, H) is defined by 

hom(G,H) = II <M»0 E[ A ° 

(u),o(v) • (4) 

<j:V(G)->V(U) ueV(G) (u,v)eE(G) 

Clearly, if a is a vector of ones and A is a matrix consisting of zeros and ones with zeros on 
the diagonal, then the definition reduces to the unweighted case where A denotes the adjacency 
matrix of a simple undirected graph H. 

A graph sequence G n = (V(G n ), E(G n )),n > 1 is defined to be sparse if Ag„ < D for some 
finite D for all n. When dealing with graph sequences we write V n and E n instead of V(G n ) and 
E(G n ), respectively. Two graph sequences G n and G n are defined to be equivalent, in which case 
we write G n ~ G n , if G n has the same number of nodes as G n and 5 e dit(G n , G n ) = o(| V(G n )|) as 
n — > oo. Observe that ~ defines an equivalency relationship on the set of sequences of graphs. 

Next we review some concepts from large deviations theory. Given a metric space S equipped 
with metric d, a sequence of probability measures P n on Borel sets of S is said to satisfy the Large 
Deviations Principle (LDP) with rate 6 n > 0, lim n 6 n = oo, if there exists a lower semi-continuous 
function I : S — > 1R+ U {oo} such that for every (Borel) set A C S 

— inf I(x) < liminf — — — - < limsup — — — - < — inf I(x), (5) 

where A° and A denote the interior and the closure of the set A, respectively. In this case, we 
say that (P n ) obeys the LDP with rate function /. Typically 6 n = n is used in most cases, but in 
our case we will be considering the normalization 8 n = \V(G n )\ corresponding to some sequence 
of graphs G n , and it is convenient not to assume that n is the number of nodes in G n . 

The definition above immediately implies that that the rate function / is uniquely recovered 

as 

I(x) = — lim liminf — - — - - — = — lim limsup — - — — , (6) 

for every x G S, where B(x,e) = {y G S : d(x,y) < e}. Indeed, from lower semi-continuity we 
have 

I(x) = lim inf I(y) = lim inf I(y), (7) 

from which the claim follows. In fact the existence and the equality of double limits in is also 
sufficient for the LDP to hold when S is compact. We provide a proof here for completeness. 

Proposition 1. Given a sequence of probability measures P n on a compact metric space S, and 
given a sequence 9 n > 0, lim n 9 n = oo, suppose the limits in the second and third term in |#|) exist 
and are equal for every x. Let I{x) be defined as the negative of these terms for every x G S. 
Then the LDP holds with rate function I and normalization 8 n . 
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Proof. We first establish that / is lower semi-continuous. Fix x,x m G S such that x m — > x. For 
every e, find tuq such that d(x,x m ) < e/2 for m > m . We have for all m > m , 



implying 



■I{x m ) < limsup^logP n (B(x m ,e/2)) < lim sup — log P n (S(x,e)), 

n (7n n t/n. 



lirnsup(— I(x m )) < lim lim sup — log P n (B(x,e)) = —I{x). 



Therefore I is lower semi-continuous. 

It remains to verify ([5]). Fix an arbitrary closed (and therefore compact) A C S, and let 
5 > 0. Then 

lim sup ^ log P n ( A) < — log P n ( A) + 5 

for an infinite number of values for n. Let Nq be the set of n for which this holds. Further, given 
m G Z+, we can find X\, . . . , XN( m ) G A such that A C \Ji<N{m)B(xi, 1/m). Let i(m, n) < N(m) 
be the index corresponding to a largest value of P n (-B(xj, 1/m)). There exists an i(m) such that 
i(m, n) = i(m) for an infinite number of values n G N Q . Denote the set of n for which this holds 
by N m , and assume without loss of generality that 6~ l log N(m) < 5 for all n G N m . For such n, 
we then have 

Pn(A) < ^ Pn(5(x l; 1/m)) < AT( m )P n ( J B(x i(m) , 1/m)) 

i<N(m) 

and hence 

limsup-^logP n (A) < 25+ ^logP n (5(x i(m) ,l/m)). 

Find any limit point x G A of Xj( m ) as m — > oo. Fix e > 0. Find ni\ such that B(xit mi ), 1/mx) C 
B(x,e), and hence 

lim sup log P n ( A) < 25 + ±- log P n ( B {x, e) ) 

whenever n G N mi . Since iV mi contains infinitely many n G Z + , we conclude that 

1 1 
lim sup — logP n (A) < 25 + lim sup — \og¥ n (B(x,e)). 

n "n n "n 

Since this holds for every 5 and e, we may apply (|6| to obtain 

1 

lim sup — logP n (A) < —I(x) < — inf I(y), 
thus verifying the upper bound in (|5]). 
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For the lower bound, suppose A C S is open. Fix e > and find x G A such that I(x) < 
inf^g^ I(y) + e. Find 5 such that 5) C A. We have 

liminf ^-\ogF n (A) > lim inf log P n (S(rc,(5)) 

n n n n 

> ~I(x) 

> - inf I(y) - e. 

Since e is arbitrary, this proves the lower bound. □ 

We finish this section with some basic notions of weak convergence of measures. Let Ai d 
denote the space of Borel measures on [0, l] d bounded by some constant D > 0. The space At 
is equipped with the topology of weak convergence which can be realized using the so-called 
Prokhorov metric denoted by d defined as follows (see |Bil99] for details). For any two measures 
p, v G At d , their distance d(p, v) is the smallest r such that for every measurable A C [0, l] d 
we have p(A) < v(A T ) + r and v(A) < p{A T ) + r. Here A T is the set of points with distance 
at most r from A in the || • norm on [0, l] d (although the choice of norm on [0, l] d is not 
relevant). It is known that that the metric space Ai d is compact with respect to the topology 
of weak convergence. We will focus in this paper on the product Ai l x At 2 which we equip 
with the following metric (which we also denote by d with some abuse of notation): for every 
two pairs of measures (pj,fij) G M 1 x At 2 , j = 1,2, the distance is d ((pi, pi), (p2, ^2)) — 
max (d(px, p 2 ), d(pi, P2)), where d is the metric on At 1 and Ai 2 . 

3 LD-convergence of sparse graphs 
3.1 Definition and basic properties 

Given a graph G = (V,E) and any mapping a : V — > [0, 1], construct (p(a),p(a)) G At 1 x At 2 
as follows. For every u G V we put mass 1/|V| on a(u) G [0, 1]. This defines \V\ points on [0, 1] 
with the total mass 1. The resulting measure is denoted by p(a) £ At 1 . Similarly, we put mass 
l/\V\ on (a(u),a(v)) G [0, l] 2 and (a(v),a(u)) G [0,1] 2 for every edge (u,v) G E. The resulting 
measure is denoted by p(cr) G Ai 2 . Formally 

where the sum goes over oriented edges, i.e., ordered pairs (u,v) such that u is connected to 
v by an edge in E, and 5 X denotes a unit mass measure on x G M. d . For example, consider a 
graph with 4 nodes 1,2,3,4 and edges (1, 2), (1, 3), (2, 3), (3, 4). Suppose a realization of a is 
er(l) = .4, cr(2) = .8,<r(3) = .15,cr(4) = .5, as shown on Figure [TJ The corresponding p(a) and 
p(a) are illustrated on Figure [2j 

Now given a sparse graph sequence G n = (V n , E n ) with a uniform degree bound max„ A Gn < 
D, let a n : V n — > [0, 1] be chosen independently uniformly at random with respect to the nodes 
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(7(1) = 0.4 (t(3) = 0.15 

Figure 1: A four node graph decorated with real valued colors 
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Figure 2: Measures p, /i associated with a four node graph with four edges 



u G V n . Then we obtain a random element (p(a n ), /i(cr n )) G A'! 1 x A'! 2 . We will use notations 
p n and p n instead of p(a n ) and p{a n ). Observe that p„([0,l]) = 1 and < /i n ([0, l] 2 ) < -D. In 
particular, /i n ([0, l] 2 ) = if and only if G n is an empty graph. We now introduce the notion of 
LD-convergence. In this definition and the Definition [2] below we assume 6 n = \V n \. 

Definition 1. A graph sequence G n is defined to be LD-convergent if the sequence (p(a n ) , p(a n )) 
satisfies LDP in the metric space A4 1 x A4 2 . 

Given a sparse graph sequence G n with degree bounded by D and a positive integer k, let 
°~n '■ Vn — > [k] be chosen uniformly at random. Consider the corresponding random weighted 
/c-quotients F n = G n /a n defined via ^ with random node weights x(a n ) = (xj(<r n ), 1 < % < k) 
and random edge weights X(a n ) = (Xij(a n ), 1 < i,j < k). Note that 'Yli x i(. a n) = 1 and 
Yli j x ij( a n) < D. Assuming without loss of generality that D > 1, it it will be convenient to 
view (x(<7 n ), X(a n )) as a (k + 1) x k matrix with elements bounded by D, namely an element 
in [0, £)]( fe + 1 ) xfc . We consider [0, _D]( fc+1 ) xfc equipped with the L M norm. That is the distance 
between (x,X) and (y, Y) is the maximum of max!<j<jt \xi — yi\ and maxx^jjxfc |Jfy — Yi t j\- 

Definition 2. A graph sequence G n is defined to be k-LD-convergent if for every k the sequence 
(x(a n ),X(a n )) satisfies the LDP in [0, D] (fc+1)xfc with some rate function I k : [0, D] (fc+1)xfc ->■ 
R + U {oo}. 

The intuition behind this definition is as follows. Given (x, X) suppose there exists a k- 
coloring of a graph G n such that the corresponding /c-quotient weighted graph is "approximately" 
equal to (x, X) to within some additive error e. Then in fact there are exponentially in \V n \ many 
/c-colorings of G n which achieve "nearly" the same quotient graph up to say an 2e additive error 
(by arbitrarily recoloring e\V n \/D many nodes). In this case fc-LD-convergence of the graph 
sequence G n means that the exponent of the number of colorings achieving the quotient (x, X) 
is well defined and is given by log/c — I(x,X). In other words, the total number of fc-coloring 
achieving the quotient (x,X) is approximately exp (|\4|(log k — I(x,X))). In case (x,X) is not 
"nearly" achievable by any coloring, we have I(x,X) = oo, meaning that there are simply no 
coloring achieving the target quotient (x,X). 

While Definition [2] might be more intuitive than Definition [TJ the latter is more powerful in 
the sense that it does not require a "for all k" condition, namely it is introduced through having 
the LDP on one as opposed to infinite (for every k) sequence of probability spaces. It is also 
more powerful, in the sense that the measure (p n , a n ) completely defines the underlying graph, 
whereas this is not the case for the fc-quotients. Since the space [0, £)]( fc + 1 ) xfc being a subset of 
a Euclidian space is a much "smaller" than the space of measures Ai d , it is not too surprising 
that the LD-convergence implies the /c-LD-convergence. But it is perhaps surprising that the 
converse is true. 

Theorem 1. A graph sequence G n is LD-convergent if and only if it is k-LD-convergent for 
every k. 

Remark 1. Given the rate function I for LD-convergence, the rate function : [0, £)]( fe+1 ) xfc — s. 
1R + U {oo} for k-LD-convergence is easy to calculate, and will be given by 

I k (x,X) = inf I(fi,p) 
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where the inf goes over all (p,p) G Ml 1 x M 2 such that 

x i+1 = p (A i}k ) , 0<i<k-l; ^ 

Xi+ij+i = p (A ijfc x A j)k ) , < i,j < k - 1, 

wife A^j. denoting the open interval A^ k = 

We will also give an explicit construction for the rate function I from the rate functions I k , 
see Corollary^ below. 

Before proving the theorem, we reformulate the notion of fc-LD-convergence in terms of 
random measures with piecewise constant densities. We need some notation. Given a positive 
integer k, let Of C [0, l] d be the open set of all points such that no coordinate is of the form 
0, 1/k, 2/k, . . . , 1. Let M k C M d be the set of measures supported on Of, and let M d C Mf be 
the set of measures in Aif. which have constant density on rectangles 

A h>k x • • • A id!k , 0<h,...,i d <k-l. 

We define a projection operator T k : M 1 x M 2 — y Ml x Ml by mapping a pair of measures 
(p, p) G A4 1 x M 2 into (pfc, pk) = T k (p, p) defined as the pair of measures in Ml x M k for which 

p k (A i>k ) = p(Ai, k ) for < i < k - 1 
// & (A i)fe x A jtk ) = p (A ijk x A jtk ) for 0<i,j,<k-l. 

Given a real valued coloring a : V n — > [0, 1] let a k = t k (o~) be the coloring a k : V n — > [k] defined 
by a k (u) = \ka(k)1 if a{u) > and a k (u) = 1 if o~{u) = 0. Note that t k (a n ) is an i.i.d. uniformly 
random coloring with colors in [k] if a n is chosen to be an i.i.d. uniformly random coloring with 
colors in [0,1]. 

Lemma 1. Let G n = (V n ,E n ), and let a n :V n ^r [0, 1] be an i.i.d. uniformly random coloring of 
V n . Then G n is k-LD- convergent if an only if T k (p(a n ) , p(a n )) obeys a LDP on Ml x M k . 

Proof. Let a n : V n — > [k] be an i.i.d uniformly random coloring of V n with colors in [k], and define 
F k : [0,D]( k+1 *) xk -> Ml x Ml as follows: given (x,X) G [0, D]( k+r > xk with x = (x h l < i < k) 
and X = (Xij,l < i,j < k), let F k ((x,X)) := (p, p) G M k x A/"| be the piece-wise constant 
measures with densities kx{ on Aj_i ^ and k 2 Xij on Aj.^ x Aj-i^, respectively. Then F k is an 
invertible continues function with continuous inverse. Thus, applying the Contraction Principle, 
{pk{v n ),Pk{v n )) = F k ((x(a n ),X(a n ))) satisfies the LDP on A^ 1 xM% if and only if (x(a n ),X(a n )) 
satisfies the LDP on [0, Dp +1 ) x& . 

To conclude the proof, we only need to show that T k (p(a n ), p(a n )) has the same distribution 
as (pk{o~ n ) , Pk{o~n)) ■ But this follows immediately from the facts that t k (a n ) has the same distri- 
bution as <7 n and that T k (p(cr n ) , p(cr n )) depends only on the total mass in the intervals A^, so 
that T k (p(a n ),p(a n )) = (p k (t k (a n )), p k (t k (a n ))). □ 

3.1.1 Proof of Theorem [T], Part 1: LD-convergence implies fc-LD-convergence 

In view of the last lemma, it seems quite intuitive that LD-convergence should imply k-LD- 
convergence, given that the set of measures M k x M k is much smaller than the set of measures 
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Ai 1 x Ai 2 . More formally, one might hope to define a continuous map from Ai 1 x Ai 2 to Ml x Af k 
(or to [0, _D]( fc+1 ) xfc ) and use the Contraction Principle to prove the desired fc-LD-convergence. 

It turns out that we can't quite do that due to point masses on points of the form i/k, 1 < i < 
k. To address this issue, we will first prove that we can restrict ourselves to Ai\ x Ai\, and then 
define a suitable continuous function from Ai\ x Ai 2 to [0, D]/ k + l ) xk to apply the Contraction 
Principle. We need the following lemma. 

Lemma 2. Suppose G n is LD-convergent with rate function I. Suppose p is such that p{{x}) > 
for some x G [0,1]. Then for every p G Ai 2 , 1(p, p) = oo. Similarly, suppose p is such that 
p({x} x [0, 1]) > or p([0, 1] x {x}) > for some x G [0, 1] . Then for every p G Ai 1 , 1(p, p) = oo. 

Proof. Suppose p is such that p({x}) = a > for some x G [0, 1]. Fix 5 < a/2. Suppose p' is 
such that d(p, p') < 5. Then, by definition of the Prokhorov metric, 

a = p({x}) < p'(x — 5, x + 5) + 5, 

implying p'(x — 5, x + 5) > a/2. Thus the event (p(a n ), p(o~ n )) G B((p,p),5) implies the event 
p(o~ n )(x — 5, x + 5) > a/2. We now estimate the probability of this event. For this event to occur 
we need to have at least a/2 fraction of values a n (u), u G V n to fall into the interval (x — 5, x + 5). 
This occurs with probability at most 

('^'Wy <2^(25Y a / 2 ^. 

i>(a/2)\V n \ ^ 1 ' 

Therefore 

limsup|Kr 1 P(pK)(2;-^2; + 5) > a/2) < log 2 + (a/2) log(25), 

n 

implying 

limlimsup {Vn^F ((p(a n ) , p(a n )) G B((p,p),5)) = -oo, 

and thus I(p,p) = oo utilizing property The proof for the case p({x} x [0,1]) > or 
p([0, 1] x {x}) > is similar. □ 

We now establish that LD-convergence implies £;-LD-convergence. To this end, we first ob- 
serve that for an i.i.d. uniformly random coloring a n : V n — > [0, 1], we have that p(o~ n ) £ M.\ and 
p(o~n) £ M.\ almost surely (as the probability of hitting one of the points i/k is zero). 

Next we claim that since (p(a n ) , p(o~ n )) satisfies the LDP in A4 1 x Ai 2 , it also does so in 
AA.\ x Ai 2 . with the same rate function. For this we will show that ^ holds when closures and 
interiors are taken with respect to the metric space Ai\ x Ai 2 k as opposed to Ai 1 x Ai 2 . Indeed, 
fix any set A C Ai\ x Ai\. Its closure in Ai\ x Ai\ is a subset of its closure in Ai 1 x Ai 2 and the 
set-theoretic difference between the two consists of measures (p, p) such that p assigns a positive 
mass to some point i/k, or p assigns a positive mass to some segment {i/k} x [0, 1] or segment 
[0, 1] x {j/k}. By Lemma [5] the large deviations rate / at these points (p, p) is infinite, so that 
the value inf I(p, p) over the closure of A in Ai\ x Ai\ and Ai 1 x Ai 2 is the same. Similarly, the 
interior of every set A C Ai\ x Ai\ is a superset of its interior in Ai 1 x Ai 2 , and again the set 



12 



theoretic difference between the two interiors consists of points with infinite rate /. This proves 
the claim. 

Consider : Ai\ x Ail — > [0, D]( k+1 > xk , where cf) k maps every pair of measures p, p into 
total measure assigned to interval A i>fc by p and assigned to rectangle A i fc x A Jifc by p, for 
i, j — 0, 1, . . . , k — 1. Namely 4>k{p,p) = (x,X) where (x,X) is given by ([9]). 

We claim that 4>k is continuous with respect to the respective metrics. Indeed, suppose 
(Pn, Hn) (p, p) in M\ x .M 2 - Since Ai d is equipped with the weak topology, then for every 
closed (open) set A C [0,1], we have limsup n p n (v4) < p(A) (liminf n p n (A) > p(A)). The 
same applies to p„,p. Setting A to be [i/fc, (i + l)/k) first and then (i/k, (i + l)/k), and using 
the fact that p n and p are supported on 0\, namely p n {i/k) = p(i/k) = for all i, we obtain 
p n (i/k, (i + l)/k) — > p(i/k, (i + l)/k). A similar argument applies to p n ,p. This proves the claim. 

Since (p k is continuous, by the Contraction Principle [DZ98], the image of (p(o~ n ) , p(a n )) under 
0^ satisfies the LDP as well. But this means precisely that we have fc-LD-convergence with rate 
function I k as defined in Remark [I] (strictly speaking, the Contraction Principle gives a rate 
function where the inf is taken over all (p,p) G Ai\ x Ail, but by Lemma [2] we can extend the 
inf to that larger set Ai 1 x At 2 without changing the value of /&). 

3.1.2 Proof of Theorem [T], Part 2: A>LD-convergence implies LD-convergence 

In order to prove that /c-LD-convergence for all k implies LD-convergence, we would like to use 
the rate functions I k : Af k x M% — > U {00} associated with the LDP for T k (p(a n ), p(cr n )) from 
Lemma[l]to construct a suitable rate function I : At 1 xAi 2 — > lR + U{oo} for the random variables 
(p(<T n ), p(o~ n )). The existence of such a rate function will follow from a suitable monotonicity 
property, see Lemma [4] and Corollary [T] below. 

Before stating Lemma [4], we state and prove a more technical lemma which we will use at 
several places in this subsection. For two pairs of measures (p, p)), (p',p') G Nl x Af k we define 

d var ((p,p)), (p',p')) = maxl max \p{A) - p'(A)| , max |p(S) - p'(5)| \. 

lAc{0,l] BC[0,1] 2 J 

Lemma 3. 

1) If(p,p) G A^^ x Ai\, then 

d((p,p),T k (p,p)) < i. (11) 
^//(p,p),(p',p')GA4 1 xA4 2 ; ^en 

d((p, p), (p', //)) < tW((p, /i)), (P, //)) < (4^ + l)d(( P , p), (p', //))■ (12) 

3) If (p, p), (p', p') G A/ 2 \ x M% k , then 

d vaT (T k (p, p)),T k (p, p')) < d var ((p, p)), (p, //))■ (13) 

Proof. 1) Set (pfc,p?c) = T fc (p, p), and let A C [0, 1]. Define A fe to be the set that is obtained by 
replacing every non-empty set A n Aj^ with the entire interval A;^. Then A G A k G A* . We 
have p(A D A i fc ) < p(A iifc ) = p fc (A iifc ) by the definition of p k . Let a = {i : A fl A, >k 7^ 0}. Then 

p(A) = ^p{A n Ai jfc ) < ^p fe (A iifc ) = p fc (A fc ) < p fc (Ai). 
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Conversely, 

Pk{A) < p k {A k ) = p{A k ) < p(A^). 

A similar argument is used for p. 

2) The lower bound follows immediately from the definitions of d and d var . To prove the 
upper bound, we first show that 

max \p(A) - p'(A)\ < {2kD + l)d(p,p'). 

To this end, we note the maximum is obtained when A is of the form A = [J iea for some a C 
{0, . . . , k — 1}. On the other hand, for sets A of this form, we have that p{A e ) < p(A)+2kep(A c ) < 
p(A) + 2keD, a bound which follows from the fact that A c = [0, 1] \ A is a union of intervals of 
length at least l/k. Choosing e = d(p, p') we then have p(A') < p(A e ) + e < p(A) + (2kD + l)e, 
which implies p'(A) — p(A) < (2kD + l)e = (2k D + l)d(p,p'). Exchanging the roles of p and p' 
gives p(A) — p'(A) < (2kD + l)d(p,p'), which completes the claim. In a similar way, one proves 
that 

B max 2 \p(B) - p\B)\ < (AkD + l)d(p,p). 

The proof uses that for sets B which are unions of sets of the form Aj^ x Aj^ we have p(B e ) < 
p(B) + AkeD. 

3) Let (p k ,Pk) = T k (p,p) and (p' k , p' k ) = T k (p',p'). We need to show that d var (p k ,p' k ) < 
dvari.Pi p') an d d var (p k , p' k ) < d var (p,p'). The first bound follows from the definition of T k and 
the fact that the maximum in the definition d var (p k) p' k ) = maxAc[o,i] \Pk(A) — p' k (A)\ is obtained 
when A is of the form A = [J i( z a A^j. for some a C {0, . . . , k— 1}. Indeed, for such an A, one easily 
shows that \pk(A) — p' k (A)\ = \p(A) — p'(A)\, which in turn implies that d var {pki p' k ) — d V ar(p,p')- 
The proof of the bound d var (p k , p! k ) < d var (p, p!) is similar. □ 

Lemma 4. Suppose G n is k-LD-convergent, and let I k : Ml x H k — > U {oo} be the rate 
function associated with the LDP for T k (p(a n ) , p(o~ n )) . Then I k (T k (p, p)) < I 2k (p, p) for every 
(p,p)eMZ k xMi k . 



Proof. Set (p k , p k ) = T k (p,p) } fix e > 0, and set e' = (4kD + l)e. By the bounds (12) and (13), 
we have that 

(Pk, Pk) = T k{p, P) e B(T k (p, p), e) (14) 

whenever (p',p r ) G B((p,p),e). On the other hand, T k (p(a n ), p(a n )) = T k (T 2k (p(a n ), p(a n ))). 
By (14), this implies that 

f(T k (p(a n ),p(a n )) e B (r k {p,p), (4fc£) + l)e)) > F(r 2k {p(a n ), p(a n )) e B (jjj, p), ejj . 

Applying (|6| we obtain the claim of the lemma. □ 

Corollary 1. Suppose G n is k-LD-convergent for all k, let I k : A/^ 1 x M k — > 1R+ U {oo} be the 
rate function associated with the LDP for T k (p(a n ) , p(o~ n )) , let (p, p) G Ai 1 x M 2 , and let B k be 
the ball B((p, p),2k- 1 )n A^ 1 x A" fe 2 . Then the limit 

I(p,p) = lim inf I 2 t(p',p') 

exists. 
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Proof. Let k G Z+. By Lemma[3j we have that T k (p', p') G 5fc whenever (p', p') G Safe. Combined 
with Lemma [4| this proves that 

inf hk(p',fJ>')> inf h{T k {p' , p')) > inf I k {T k (p", //)). 

The claim now follows by monotonicity. □ 

We are now ready to prove that fc-LD-convergence for all implies LD-convergence. Let 
o"« . : Ki — ► [0, 1] be an i.i.d. uniformly random coloring of V n . Recalling Lemma [TJ we will 
assume that T k (p(a n ) , p(cr n )) obeys a LDP with some rate function I k . We will want to prove 
that (p(cr ra ), p(cr n )) obeys a LDP with the rate function I defined in Corollary [T] 

Fix (p, p) G M 1 x A/1 2 . By the Lemma [3j the fact that (p(a n ) , ji(a n )) G M\ x A/f 2 , almost 
surely, and the triangle inequality, we have that 

liminf \Vn\~ 1 logP ((p(cx n ), p(a n )) G B((p, p), 3AT 1 ) 

n 

> liminf IKI" 1 log F(T k (p(a n ),p(a n )) G B((p, p), 2AT 1 )) 

> - inf 4(p',p), 

where the infimum is taken over (p', p') G B((p, p), 2&T 1 ) fl A/jJ x Af k . Similarly, 
limsup|Kr 1 logP((p(a n ),p(a n )) G ^((p^Ar 1 )) 

n 

< limsuplKr'logP^^pK)^^)) G B^p.p)^,- 1 )) 

n 

< - inf 4(p',p'), 

where the infimum is again taken over (p', p') G -B((p, p), 2/c" 1 ) fl A/^ x A/" 2 . 

To complete the proof we apply Proposition [T] in conjunction with Corollary [TJ yielding that 
(p(o~ n ), p(o~ n )) obeys a LDP with rate function /. 

3.2 Basic properties and examples 

We begin by showing that the definition of LD-convergence is robust with respect to the equiv- 
alency relationship ~ on graphs. 

Theorem 2. If G n is LD-convergent and G n ~ G n , then G n is also LD- convergent. 

Proof. We apply Theorem [l] and show that if G n is fc-LD-convergent then G n is also fc- LD- 
convergent. Fix any k, any (x,X) G [0, _D]( fc+1 ) xfc and e > 0. We denote by P the probability 
measure associated with (x(a n ), X(a n )) = G n /a n when a n : V n — > [k] is a random uniformly 
chosen map on V n . The definition of ~ implies that for all sufficiently large n 



F((x(a n ),X(a n )) G B ((x, X), e/2) J < F((x(a n ), X(a n )) G B ((x,X), e) 
implying 

limlimsup|Kr 1 logP((£(a n ),A > (a n )) G B {{x, X), e/2) ) 

< limlimsup IK)" 1 logPf (x(a n ), X{a n )) G B ((x, X), e) ) . 



£->0 
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Similarly, we establish that 



>0 n 



lim lim inf | V n | " 1 log P ( (x (cr n ) , X(cr n ) ) G B ( (z, X) , e) 



>limliminf lo g p((x(a n ), X(<x n )) G £ ((x, X), e/2) ) . 

Since G n is fc-LD- convergent, then from (p| we have 



lim lim sup | V n | 1 logP(-) = limliminf \V n \ 1 logP(-). 

Thus the same identity applies to P. Applying Proposition[l]we conclude that P is LD-convergent. 

□ 

Let us give some examples of LD-convergent graph sequences. As our first example, consider 
a sequence of graphs which consists of a disjoint union of copies of a fixed graph. 

Example 1. Let Go be a fixed graph and let G n be a disjoint union of n copies of Go- Then G n 
is LD-convergent. 

Proof. We use Theorem [T] and prove that G n is &;-LD-convergent for every k. Fix k, let be 
the space of all possible fc-colorings of Go, and let M = k^ v ^ Go ^ denote the size of Every 
fc-coloring a n : V n — » [k] of the nodes of G n can then be encoded as a n = (c")i<j< n , where 
cr" G S fe is a /c-coloring of the z-th copy of G i n G n . Consider the M-dimensional simplex Sm, 
i.e., the set of vectors (zi, . . . , zm) such that J2i<m z * = z i — 0- ^ or ever y m — 1, • • • , M and 
a n = {&i)i<i<m let z m {a n ) be the number of <r™-s which are equal to the m-element of divided 
by n, and let z(a n ) = (z m (a n ))i< m <M £ Sm- By Sanoff's Theorem, the sequence z(a n ),n > 1 
satisfies the LDP with respect to the metric space Sm- Now consider a natural mapping from Sm 
into [0, _D]( fc + 1 ) xfc ? where each z = (z m )i< m <M is mapped into (x,X) G [0, ZJp +1 ) xfc as follows. 
For each element of encoded by some m < M and each % < k, let x(i,m) be the number 
of nodes of Go colored i according to m, divided by |V(Go)|. In particular J2i x (h m ) = 1- 
Similarly, let X(i,j,m) be 2 times the number of edges in Go with end node colors % and j 
according to m, divided by |y(G )|. In particular ^2 i jX(i,j,Tn) = 2\E(G )\/\V(Gq)\. Setting 
x i = Y, m x (h m ) z m,X id = Y Jm x (hj, m ) z m, this defines the mapping S M into [0, D] (fe+1)xfc . 
This mapping is continuous. Observe that the composition a n —> z(a n ) — > (x,X) is precisely 
the construction of the factor graph F = G n /a n via (|3]). Applying the Contraction Principle, we 
obtain LD-convergence of the sequence G n . □ 

Perhaps a more interesting example is the case of subgraphs of the <i-dimensional lattice Z d , 
more precisely the case of graph sequences L djn with vertex sets V^ n = { — n, — n+1, . . . , n} d . It is 
known that the free energy of statistical mechanics systems on these graphs has a limit |Sim93] , 
|Geo88j . In our terminology this means that the sequence (Ld, n ) is right- convergent (see Sec- 
tion [4]). Here we show that this sequence is also LD-convergent. 

Example 2. Let d > 1, let V^ n = {—n, —n + 1, . . . , n} d , let E n ^ be the set of pairs {x, y} C V^ n 
of £1 distance \, and let L^ n = (Vd, n ,E^ n ). Then the sequence (L^ n ) is LD-convergent. 
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Proof. We again apply Theorem [T] and establish fc-LD-convergence for every k. Fix (x, X) G 
[ 0)J D]( fe+1 ) xfc and let 



logP((x(a n ),XK)) G B((x,X),5) 

Ik{ x ) — — limlimsup 



s ~*° n Wd,n\ 

where a n : V d ^ n — > [k] is chosen uniformly at random. The limit lim^o exists by monotonicity. 
Fix an arbitrary 5 > 0. Then again by monotonicity 



logP((x(a n ),XK)) G B((x,X),5/2) 
limsup — — j > -I k (x,X), 



and we can find n so that 

logP((a;K ),X(a no )) G B((x, X), 5/2)) 

7Z~, ! > —Ik 



\v. 



d,no I 



k (x,X)-5. (15) 



Consider arbitrary n > n , and set q = L^+lJ? M = q d and m = (2n + l) d — M(2n Q + l) d . 
The set of vertices V d>n can then be written as a disjoint union of M copies of V d;riQ and m sets 
consisting of one node each. Let Hi, . . . , Mm be the induced subgraphs on the copies of V d}rLQ , 
and let L d>n = (Vd,n, E djTl ) be the union of Hi, . . . , Hm- Then 

\V d , n \ = \Vd,n\ -m= \V d>n \(l -O(n /n)) 

and 

\E dtn \ = \E djH \ - OiMni- 1 ) - 0{m) = \E d>n \ - 0(n% r ) - Ofa^no). 

A uniformly random coloring o~ n : V d . n — > [k] then induces a uniformly random coloring 
5"« : Vd,n [k]. Due to the above bounds, the corresponding quotients L d ^ n /a n = (x(a n ),X(a n )) 
and L dyn /a n = (x(a n ), X(a n )) are close to each other. More precisely, 

d((x(a n ),X(a n )), (x(a n ),X(a n ))) = O(n /n) + O(l/n ). 



Then we can find no large enough so that (15) still holds, such that for all large enough n, 

d[{x{a n ),X{a n )), {x{a n ),X{a n )j) < 5/2. 

Then 

¥((x(a n ),X(a n ) G B((x,X),5))) > P ((x(a n ),X(a n )) G B((x,X),5/2)) . 

On the other hand, since X^j{a n ) are counted over a disjoint union of graphs identical to L djYl0 , 
then if a n is such that within each graph H r we have d((x,X), (x(a n ), X{a n ))) < 5/2, then the 
same applies to the overall graph L djn . Namely, 

M 

F[(x(a n ),X(a n )) G B(x,5/2)) > (P((x(<T no ),X(cr no )) G B(x } X } 5/2)) 
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where the second expectation is with respect to uniformly random coloring of L^ no - Since 
M < \V djn \/\V djno \, we obtain 



1 



logP(xK),XK)) G B{x,8)) > -J—\ogF((x(a no ),X(a no )) E B{x,X,8/2) 



I Vd,n | 

>-I k (x,X)-6, 



where the second inequality follows from (15). Since this holds for all large enough n, then from 
the bound above we obtain 



limliminf -——\og¥((x(cr n ),X(a n )) e B((x,X),5)) > -I k (x,X) 



We conclude that the relation (|6j) holds. 



□ 



4 Other notions of convergence and comparison with LD- 
convergence 

In this section we consider other types of convergence discussed in the earlier literature and 
compare them with LD-convergence. Later we will show that every other mode of convergence, 
except for colored-neighborhood-convergence, is implied by LD-convergence. We begin with the 
notion of left-convergence. 

4.1 Left-convergence 

We start with the definition of left-convergence as introduced in |BCKL13] : 

Definition 3. A sequence of graphs G n with uniformly bounded degrees is left- convergent if the 
limit 

lim |Kr 1 hom(e,G„) (16) 

exists for every connected, finite graph H. 

As already noted in |BCKL13] . this notion is equivalent to the notion of Benjamini-Schramm 
convergence of the sequence G n = (V n , E n ). To define the latter, consider a fixed positive integer 
r, and a rooted graph EI of diameter at most r. Define p„(H, r) to be the probability that the 
r-ball around a vertex U chosen uniformly at random from V n is isomorphic to H. The sequence 
is called Benjamini-Schramm convergent if the limit p(M.,r) = lim n p n (H, r) exists for all r and 
all routed graphs EI of diameter r or less. 

Recall our definition of relation ~ between graph sequences. It is immediate that if G n is 
left-convergent and G n ~ G n then G n is also left-convergent with the same set of limits p(H, r). 
Thus left-convergence can be defined on the equivalency classes of graph sequences. 
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4.2 Right-convergence 

For sequences of bounded degree graphs, the notion of right-convergence for a fixed target graph 
EI was defined in [BCKL13] . In this paper, we define right-convergence without referring to 
a particular target graph, and require instead the existence of limits for all soft-core graphs 
H. Recall that for a weighted graph EI with vertex and edge weights given by the vector a = 
(cki(H), . . . , afc(Hf)) and the matrix A = (Ay (H))i<ij<&, respectively, the homomorphism number 
hom(G, H) is given by Q. Without loss of generality we assume Q!i(H) > for all i. We say 
that EI is soft-core if also Ay(H) > for all i, j. 

To motivate our definition of right-convergence, let us note that for a soft-core graph EI on k 
nodes, 

< o^ lE " 1 < fc-^hom(G n ,H) < < offi 1 )™ 

where 

a max = max ( 1 , max a j , max Ay ) and a mm = min ( 1 , min a j , min Ay ) . (17) 

Thus log hom(G n , EI) grows linearly with \V n \. We will define a sequence to be right-convergent 
if the coefficient of proportionality converges for all soft-core graphs. 

Definition 4. A graph sequence G n is defined to be right- convergent if for every soft-core graph 
M, the limit 

a ,._ loghom(G n ,EI) 



-/(H) = lim 



\Vn\ 



exists. 



In the language of statistical physics, the quantity hom(G, EI) is usually called the partition 
function of the soft-core model with interaction EI on G, and the quantity /(EI) is called its free 
energy Sometimes we will write /(g„)(H) instead of /(EI) to emphasize the dependence on the 
underlying graph sequence G n . 

Observe that our definition is robust with respect to the equivalency notion ~ on graph 



sequences. Namely if G n ~ G n , then the limits (18) are the same. Indeed, deleting or adding 
one edge in G n increases the partition function by a factor at most max Ay and decreases it by 
a factor at most mm Aij. Thus adding/deleting o(| V^|) edges changes the partition function of a 
soft-core model by a factor of exp(o(|V^|). 

By contrast, the partition function of a hard-core model can be quite sensitive to adding or 
deleting edges, as we already demonstrated in the introduction, where we discussed the case of 
cycles. The same observation can be extended to the case when G n is a ci-dimensional cylinder or 
torus, again parity of n determining the two-colorability property, see |BCKL13] for details. An 
even more trivial example is the following: let G„ be a graph on n isolated node (no edges) when 
n is even, and n — 2 isolated nodes plus one edge when n is odd. Then there exists 1-coloring 
of G n if and only if n is even, again leading to the conclusion that G n is not converging on all 
simple graphs EI. 

We adapted Definition [4] to avoid these pathological examples. Nevertheless, we still would 



like to be able to define the limits (18) for hard-core graphs EI, and do it in a way which is 



insensitive to changing from G n to some equivalent sequence G n ~ G n . There are two ways to 
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achieve this which are equivalent, as we establish below. Given A > and a (not necessarily 
soft-core graph) EI with edge weights given by a matrix A = (Ajj(H)), let A\ be the matrix with 
positive entries 

{A x )ij = max{A,Aj(H)} 
and let Ha be the corresponding soft-core graph. 

Theorem 3. Given a right-convergent sequence G n and a weighted graph M, the following limit 
exists 

/ (Gn) (H)^lim/ (Gre) (H A ). (19) 

A — >-(J 

Furthermore, there exists a graph sequence G n ~ G n such that for every graph HI 

logkom(G n ,M) logkom(G w ,M) 
-/(G„)P) =hm j— j > sup km sup — , (20) 

n \ V n\ G„~G„ n \ V n\ 

where the supremum is over all graphs sequences G n which are equivalent to G n . In particular, 
the maximizing graph sequence G n exists and can be chosen in such a way that the km sup is 
actually a limit. 

Tke intuition of tke definition above is tkat we define /( ( g ri )(EI) by sligktly softening tke "non- 
edge" requirement of zero elements of tke weigkt matrix A. Tkis turns out to be equivalent to 
tke possibility of removing o(|V^|) edges in tke underlying grapk in trying to ackieve tke largest 
possible limit witkin tke equivalency classes of grapk sequences. One can furtker give a statistical 
pkysics interpretation of tkis definition as defining tke free energy at zero temperature as a limit 
of free energies at positive temperatures. 



Proof. Tke existence of tke limit ( 19 ) follows immediately by monotonicity: note tkat kom(G, HI) 
is monotonically non-decreasing in every element of tke weigkt matrix A. 

Tke proof of tke second part is more involved, even tkougk tke intuition bekind it is again 
simple: First, one easily skows tkat monotonicity in A and tke fact tkat tke limit (18) is not 
ckanged wken we ckange o(|V„|) edges implies tkat for every G n ~ G n 

logkom(G n ,H) 

km sup — < -/ (Gw) (H). (21) 

n | v n \ 

To prove tkat /(c n )(H) is ackieved asymptotically by some G n ~ G n we may assume witkout loss 
of generality tkat /(c n )(HI) < oo, since otkerwise tke identity kolds trivially. Consider a "typical 
configuration" o : V n — > V(W\) contributing to bom(G n ,HI A ), and let E (a) be tke set of edges 
{u,v} e E n suck tkat A a{u)jCT(v) (B.) = 0, 

E (a) = {(u,v) e E n : A a[u)>a{v) (M) = 0}. (22) 

For small A, we expect tke size of tkis set to grow slowly witk \ V n \, since otkerwise kom(G n , HI^) 
would become too small to be consistent witk lim^o /( (Gn )(HI A ) < oo. We migkt tkerefore kope 
tkat we can find a subset E 0tn C E n suck tkat (i) \E 0>n \ = o(\V n \) and (ii) removing tke set 
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of edges E 0n from E n leads to a graph sequence G n such that \V n \ 1 loghom(G n , EI) is close to 
-/(G*)(H).' 

It will require a little bit of work to make this rather vague argument precise. Before doing 
so, let us give the proof of (21), which is much simpler: Indeed, by monotonicity, for for every 
A > we have 

loghom(G n ,H) hom(G n ,H A ) 
lim sup : — : < hm sup 



log hom(G n ,EI A ) 



lim sup 

i 

lim 



|K| 

loghom(G n ,EI A ) 



= -f(G n )(M\), 

where the first equality follows since EI A is a soft-core graph. Passing to the limit A — > 0, we 
obtain (21), which in turn implies 

loghom(G n ,H) 
sup hmsup — < -/ (Gb) (M). 

<G„~G„ ™ \ V n\ 

We now show that /(c n )(H) is achieved asymptotically by some G n ~ G n . Our main technical 
result leading to the claim is as follows. 

Lemma 5. For every e G (0, 1), every graph H, and for all large enough n, there exists a graph 
G n which is obtained from G n by deleting at most e\V n \ edges such that 

hom(G n ,H) > exp (-(/ (G „)(H) + e)\V n \) . (23) 

We first show that this lemma implies the required claim. It relies on a standard diagonaliza- 
tion argument. Note that it suffices to establish the result for graphs EI with rational (possibly 
zero) weights. Let EI^E^, . . . be an arbitrary enumeration of graphs with rational weights. For 
every m = 1, 2, . . . , we find large enough n(m) such that for n > n(m), the claim in lemma holds 
for e = 1/m 2 and for all EI = Hi, ... , EI m . Namely, for every n > n(m), graphs G n ,i, ■ ■ ■ , G nm 
can be obtained from the graph G n each be deleting at most n/m 2 of edges in G n such that (23) 
holds for e = 1/m 2 . Without loss of generality we may assume that n(m) is strictly increasing 
in m. Let E n ^ be the set of edges deleted from G n to obtain G nji for 1 < i < m and G™ be the 
graph obtained from G n by deleting the edges in Ui<j< m i? nji . The number of deleted edges is at 
most m(n/m 2 ) = n/m. By monotonicity, (23) implies that 

hom(G™,e,,) > exp (-(/ (Gn) (H,) + l/m 2 )\V n \) , i = 1,2, . . . ,m. 

We now construct the graph sequence G n . For all n < n(l) we simply set G n = G n . For all other 
n, we find a unique m n such that n{m n ) < n < n(m n + 1) and set G n = G™ n . Then G n ~ G n 
since the number of deleted edges is at most \V n \/m n = o(\V n \) as n — > oo. For this sequence we 
have 

IKI-Moghom^,^) > -/ (Ga )(Hi) - 1/ml, % = 1, 2, . . . , m n . 



21 



Fixing i and taking liminf n of each side we obtain for each i 

liminf iKr^oghomC^Hi) > -/(&») (Hi). 



Combining with (21) we conclude that for each i 

limlKl-Moghom^,^) = — /(g )(Hj). 

n 

This completes the proof of the theorem. □ 

Proof of Lemma\5^ For the ease of exposition we write /(H) for /(g n )(H). If /(H) = oo, there is 
nothing to prove, so we assume /(H) < oo. 
Let 

Ot\{(r) = Y[ a <r(u) I I m ax(4(u),(7(B)iA) 

ueVn (u,v)£E„ 

and let £(-E) be the set of a : V n — > [k] such that Eo(a) = E. Then 

hom(G„,H A ) = Yl 

EcE n ff 6S(B) 

We will prove that for A sufficiently small and n sufficiently large, we can find a set E C E n 
such that \E \ < e\V n \ and 

«A(a)>exp(-(/(e) + e)|K|). (24) 

^eE(Bo) 

Note that this bound immediately implies the claim of the lemma. Indeed, let G n be the graph 
obtained from G n by deleting the edges in E and assume with out loss of generality that A is 
smaller than the smallest non-zero entry of A. Then every a G S(i?o) satisfies a\(cr) = A' E °'<5((j), 
where a is the weight with respect to the graph G n , vector a and the matrix A. This implies 

hom(G n ,H)> a(ff) = A- |Eo1 ^ a x (a) 

> «A(^)>exp(-(/(H) + e)|K|), 



as required. The proof of the lemma is therefore reduced to the proof of (24). 
Given < e < 1 we chose < A < e such that 

3A + AlogD + Alog(l/A)- 1 < e, (25) 

and < A < A such that 

/(M) + 2 + (£> + !) log cw + logfc ~ 
log(A x ) 
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Here we note that —/(H) < log A; + (D + 1) loga max , and thus the numerator in the left-hand 
side is positive. Let n = n (X) be large enough so that 

log(D\V n \) < X\V n \ and hom(G n ,H A ) > exp((-/(H A ) - A)|V n |) 

for all n > Wo(A). Note that by monotonicity in A, the second bound implies that 

hom(G n ,H A ) >exp((-/(H)-A)|K|). (27) 

Fix an arbitrary such n > n , and let E(r) be the set of mappings a : V n — > V(H) such that 
precisely r edges of are mapped into non-edges of H, 

S(r) = {a:K^V(H): |%)hr}. 

For all A which are smaller than the smallest positive element of A we have 

hom(G n ,H A ) = £ ^(^)- 

0<r<|£„| creS(r) 

Let ro be such that 

oc\(<r) > | J E„r 1 hom(G n ,e A ). (28) 

aeS(r ) 

We claim that r < A|V^|. Indeed, since for every a G S(r ) we have a A (a) < A ro a!JH + '' En ' < 
A^a^ 1 ^" 1 , it follows from @ that 

fc'^'A^a^l^l > |Kr 1 hom(G n ,e A ) >exp((-/(H)-2A)|K|), 



where the second inequality follows from (27) and the fact that for n > no, we have \E n \ < 
D\V n \ < e A ' yn L Rearranging, we get that 

r < (log(l/A))- 1 (|K|(/(H) + 2A + (D + 1) logcw + log A:)). 



Applying ( 26 ) , this in turn implies that 



r < (log A- 1 )- 1 !^! (/(H) + 2A + (D + 1) log« max + log k) 

< (log A- 1 )- 1 !^! (/(H) + 2 + (D + 1) loga max + log A;) 

< X\V n \, (29) 



as claimed. 

Next we observe that 



Y = Y Y ax ^- 
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Since the number of subsets of the edges of G n with cardinality r is ('f™'), we can find a subset 
Eq C E n , \E \ = r such that 



o-eS(_E ) 



E n 
r 



o-eS(ro) 



-1 



> I lE r n Q l ) exp((-/(H)-2A)|K|) 



where the second inequality follows from (28). Next, we note that f, ) < 



(30) 



J , a fact which 



can easily be proved by induction on k. Combining this bound with the fact that \E n \ < D\V n \ 



and the bounds (29) and (25), we get 

\E n \\ < (D\V n \\ < (D\V n \\ < /eD 



A[V n | 



,((l+logD)A+Alog(A- 1 ))|K I | < e (e-2A)[V„| 



Combined with (30) this implies the bound (24) 



□ 



4.3 Partition-convergence and colored-neighborhood-convergence 

The notions of partition-convergence and colored-neighborhood-convergence were introduced in 
Bollobas and Riordan [BRlfj. Recall the notion of a graph quotient F = G/a defined for every 
graph G and mapping a : V(G) — > [m]. Then F is a weighted graph on m nodes with node 
and edge weights {x(a), X(a)) G [o,D]( m+1 ) xm defined by (g). When G is the n-th element of 
a graph sequence G n , we will use notations x n (a) and X n (a). Let E n (m) be the set of all pairs 
(x„(<j), X n (a)), when we vary over all maps a. As such E n (m) G V ([0, £)]( m + 1 ) xm j ) where P(^4) 
is the set of closed subsets of A. For every set A C [0, £)]( m + 1 ) xm ; j e t A e be the set of points in 
[0, _D]( m + 1 ) xm with distance at most e to A with respect to the L m norm on [0, £)]( m + 1 ) xm (the 
actual choice of metric is not relevant). Define a metric p m on closed sets in V ([0, /)]( m + 1 ) xm j 
via 



p(i,B)=inf{5cA £ ,Ac5 e }. (31) 

Definition 5. ^4 graph sequence G n is partition- convergent if for every m, the sequence E ra (m), n > 
1 is a Cauchy sequence in the metric space V([0, £)]( m + 1 ) xm ). 

It is known that V([0, £)]( m + 1 ) xm ) i s a compact (thus closed) metric space, which implies that 
S ra (m) has a limit E(m) G "P([0, _D]v m+1 ) xm ). Loosely speaking E(m) describes the limiting set 
of graph quotients achievable through the coloring of nodes by m different colors. Thus, if for 
example some pair (x,X) belong to E(m), this means that there is a sequence of partitions of 
G n into m colors such that the normalized number of nodes colored i converges to Xi for each i, 
and the normalized number of edges between colors i and j converges to for each i, j. 

The definition above raises the question whether partition-convergence implies convergence 
of neighborhoods, namely left-convergence. We will show in the next section that this is not 
the case: partition-convergence does not imply left-convergence (whereas, as we mentioned 
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above, right-convergence does imply the left-convergence). In order to address this issue Bol- 
lobas and Riordan extended their definition of partition-convergence |BRllj to a richer notion 
called colored-neighborhood-convergence. This notion was further studied by Hatami, Lovasz and 
Szegedi [HLS12J under the name local-global convergence, to stress the fact that this notion cap- 
tures both global and local properties of the sequence G n . We prefer the original name, given 
that both right and LD-convergence capture local and global properties as well. 

To define colored-neighborhood-convergence, we fix positive integers m and r. Let l-i m ,r C % 
denote the finite set of all rooted colored graphs with radius at most r and degree at most A, 
colored using colors 1, . . . , m. For every graph G with degree < A and every node u G V(G), every 
coloring a : V(G) — > [m] produces an element EI G "H m , r , which is the colored r-neighborhood 
Bq(u, r) of u in G. For each EI G "H m , r , let N(G, m, r, a, EI) be the number of nodes u such that 
there exists a color matching isomorphism from Biq(u, r) to EI. Namely, it is the number of times 
that EI appears as a colored neighborhood B G (u,r) when we vary u. Let 7r(G, m, r, a) be the 
vector (\V(G)\~ 1 N(G, m, r, a, EI), EI G "H m , r ). Namely, it is the vector of frequencies of observing 
graphs EI, as neighborhoods BQ(u,r) when we vary u. Then for every m, we obtain a finite 
and therefore closed set %(G,m,r) C [0, l]l' Wm > r l when we vary over all m-colorings a in vectors 
ir(G,m,r,a). As such, 7r(G,m, r) G "P([0, l] |Wm ' r| ). We consider "P([0, l] |Wm > r| ) as a metric space 



with metric given by (31). 



Definition 6. A graph sequence G n is defined to be colored-neighborhood-convergent if the se- 
quence ii(G n ,m,r) is convergent in P([0, l]l^ m > r l) for every m,r. 

It is not too hard to see that colored-neighborhood-convergence implies partition-convergence 
and left-convergence. As we show in the next section, the converse does not hold for either 
partition- or left-convergence. 



5 Relationship between different notions of convergence 

In this section we explore the relations between different notions of convergence. As we will see, 
most of the definitions are non-equivalent and many of them are not comparable in strength. 
Our findings are summarized in the following theorem. 

Theorem 4. The notions of left, right, partition, colored-neighborhood and LD-convergence sat- 
isfy the relations described on Figures [3] and 01 where the solid arrow between A and B means 
type A convergence implies type B convergence, and the striped arrow between A and B means 
type A convergence does not imply type B convergence. 

We have deliberately described the relations in two figures, since at this stage we do not know 
how LD and colored-neighborhood-convergence are related to each other. As a step towards this it 
would be interesting to see whether colored-neighborhood-convergence implies right- convergence. 
We will discuss this question along with some other open questions in Section [6j 

Note that it was already proved in [BCKL13J that right-convergence implies left-convergence, 
and that it is obvious that colored-neighborhood-convergence implies partition and left-conver- 
gence. To prove the Theorem, it will therefore be enough to show that LD-convergence implies 
both right and partition-convergence, that left-convergence does not imply right-convergence, 
that partition-convergence does not imply left-convergence, and that right-convergence does not 
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LD-convergence 




partition-convergence 



Figure 3: Implications between LD and other notions of convergence 

imply partition-convergence (all other arrows are implied). We will start with the negative 
implications, which are all proved by giving counter-examples. 

5.1 Left-convergence does not imply right-convergence 

As discussed earlier left-convergence does not imply right-convergence on all hard-core graphs 
H. But our modified definition of right-convergence involving only soft-core graphs raises the 
question whether with our new definition, left-convergence does imply right-convergence. It 
turns out that this is not the case since right-convergence implies convergence of certain global 
properties like the convergence of the maximal cut in a graph, a property which is not captured 
by the notion of left-convergence. 

Indeed, let G n be right-convergent, let MaxCut(G n ) denotes the size of the maximal cut in 
G n , and let EI be a graph consisting of a single edge with weights a = (1, 1) and A 1<2 = A 2> i = 
eP,A 1A = A 2)2 = 1. Then 

e /3MaxCut(G n ) < hom ( Gn; e) < 2™e /3MaxCut(G " ) . 

Sending (3 — > oo, we see that right-convergence implies convergence of ^ MaxCut(G ra ). 

As an immediate consequence, we obtain that left-convergence does not implies right-conver- 
gence. 

Example 3 ([BC KLT3] ). Let A be a positive integer, let Q(n, A) denote a random A-regular 
graph on n nodes, and let B n ^ denote a random A-regular bi-partite graph on n nodes. Let G n 
be equal to G(n, A) when n is odd and equal to B n ^ when n is even. Then G n is left- convergent 
for all A, but for A sufficiently large, it is not right- convergent. 
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colored neighborhood-convergence 




partition-convergence 



Figure 4: Implications between colored-neighborhood and other notions of convergence 



Proof. G n is left-convergent since for every r, with high probability B(U, r) is a A-Kelly (regular) 
tree truncated at depth r, where U is a random uniformly chosen node in G n . We now show that 
G n is not right-convergent for large A. Indeed, MaxCut(G n ) = MaxCut(-B nj A) = for even 
n, while MaxCut(G„) = MaxCut(G(n, A)) = \An + 0(^~Kn) for odd n |BCP97| . This implies 
that for large A, the sequence G n is not right- convergent. □ 

5.2 Right-convergence does not imply partition-convergence 

In order to construct our counter example, we need the following lemma. We recall that the edge 
expansion constant of a sequence of graphs G n is the largest constant 7 such that the number of 
edges between W and V(G n ) \ W is larger than i\W\ whenever \W\ < |V(G n )|/2. 

Lemma 6. Let A > 3. Then there exists a 7 > and a right- convergent sequence of expanders 
G n with edge expansion constant 7 and maximal degree bounded by A. 

Proof. Consider an arbitrary sequence G n = (V n , E n ) of expanders with edge expansion 7 > 
and maximal degree at most A, e.g., a sequence or 3-regular random graphs. Construct a 
right-convergent subsequence of this sequence by the following, standard diagonalization argu- 
ment: enumerate the countable collection of weighted graphs with rational positive weights: 
Hi, H2, ... . For each m = 1,2, ... , construct a nested subsequences C r^™ -1 ), such that 
G n is right-convergent with respect Hi, . . . ,H m along the subsequence n^ m \ and then take the 
diagonal subsequence nffl - the m-element of the m-th sequence. The constructed sequence of 
graphs is right-convergent with respect to every weighted graph H with rational coefficients. A 
straightforward argument shows that then the sequence is right-convergent with respect to all 
graphs with positive real weights, namely it is right- convergent. □ 
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Example 4. Given a right- convergent sequence of expanders G n with bounded maximal degree 
and edge expansion 7 > 0, let G n = G m if n = 2m is even, and let G n be a disjoint union of two 
copies of G m if n = 2m + 1 is odd. Then (G n ) is right- convergent but not partition-convergent. 

Proof. Let G n = (V n , E n ). For every weighted graph H, 

|1/ 2n+1 r 1 logZ(G 2n+1 ,H) = (2|K|)" 1 (21ogZ(G n ,H)) 

= iKl^log^G^H) 
= \V 2n \- l \ogZ{G 2n ,W). 

Since G n is right-convergent, G n is right-convergent as well. 

On the other hand, we claim that G n is not partition-convergent, since every cut of G n = 
G m into two equal parts has size at least 7|\4|/2 if n is even, while G n has a cut into equal 
parts with zero edges when n is odd. Indeed, denoting the node set of G n by V n , consider the 
set £„(2) of pairs (x n (a), X n [a)) of vectors x n (a) = (x"(cr), x% (cr)) and matrices X n (a, 2) = 
PQj(o-), 1 < i, j < 2) when we vary over a : V n — > [2]. Consider the projection of S n (2) onto 
[0,D] 3 corresponding to (arj(<r), x^(a),X^ 2 (a)). Consider the point a = (1/2,1/2,0) G [0,D] 3 
Observe that for each n, there exists a a such that (a^ n+1 (cr), xl n+1 (a), Xl n 2 +1 (a)) = a, since we 
can split the graph G2 n +i into two parts with the same number of nodes in each part and no 
edges in between the parts. On the other hand for each graph G 2n and each a, the distance 
between (xf n (a), x| n (cr), Xf^(cr)) and a is bounded from below my min{l/4, 7/4} (indeed, if 
\x1 n (a) — 1/2| < 1/4, then by the expansion property of G n , Xf^cr) > 7/4). This shows that 
the sets {(xi(a), x 2 (cr), X" 2 (<r))} obtained by varying cr are not converging as n — > 00 and thus 
the sequence G n is not partition-convergent. □ 



5.3 Partition-convergence does not imply left-convergence 

We now exhibit an example of a graph sequence which is partition-convergent but not left- 
convergent. 

Example 5. Let G n be a disjoint union of n 4-cycles when n is even and a disjoint union of n 
6-cycles when n is odd. Then G n is partition-convergent and not left-convergent. 

Proof. G n is clearly not left-convergent. Let us show that it is partition-convergent. For this 
purpose set D = 2 and consider arbitrary k. Let S(A;) be the set of all (x,X) e [0, _D]( fc+1 ) xfc 
such that Z^i<i<k x i = 1> = f° r a ^ hj> anc ^ 

k 

y^X id = 2xi foralU. (32) 
i=i 

We will prove that E n (/c) — > S(/c) which shows that the sequence is partition-convergent with 
S(A;) being the limit. 

To this end, we first show that E n (/c) C S(/c) for all n. Indeed, let cr : V n — >■ [k]. By the 
definition (|3]), we have that 

k 

£^» = ^£ E 1 = ^E 2 = 2 ^)' 

j=i 1 n| ueVi vsv-n-- 1 n| ueVi 
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where we used that the degree of all vertices in G n is two. This proves that (32) holds for all 

(x,x)ez n (k). 

We now claim that for every element (x,X) G S(A;) and every n there exists a n : V n — > [k] 
such that the L ro distance between (x,X) and (x(a n ), X(a„)) is at most k/n. This is clearly 
enough to complete the proof, since it implies that for every e > and n large enough, we have 

c (£ n (k)y. 

To prove the claim, we first consider even n so that G n is a disjoint union of n 4-cycles. To 
this end, let n^j, i, j G [k], be such that 



n i,j 2 J 



< 1 for all i,j G [A;] 



and 
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> -J 



n. 



»je[fc] i,je[k] 

We consider the following coloring a n of G n : For each % ^ j we take rijj cycles and color their 4 
vertices alternating between color % and j as we go around these cycles, while for % = j, we take 
riii cycles and color them with only one color, i = j. For this coloring, we have 



so by our choice of Tiij we have that 

Since E n (fc) C we have 

|xj - Xi(cr)| 



X i,j ~ X i,M)\ < 



^2(Xij -Xij(a) 



< 
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completing the proof of the claim when n is even. A similar result is established for odd n, when 
G n is a disjoint union of n 6-cycles. □ 



5.4 LD-convergence implies right-convergence 

Let the sequence G n be LD-convergent. Applying Theorem [T] it is also fc-LD-convergent with 
rate functions Ik,k > 1. Consider an arbitrary /c-node weighted graph EI with strictly positive 
node weights a = (ctj, 1 < i < k) and strictly positive edge weights A = (A i; j, 1 < i, j < k). We 
define an "energy functional" £e on [0, _D]( fc+1 ) xfc by 

£ m (x,X) = - E Xilogai-^ E XijtogAij. (33) 

We also introduce an "entropy functional" Sk on [0, _D]( fc + 1 ) xfc by setting 

S k {x,X) = logk-I k {x,X). (34) 
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Theorem 5. Suppose the sequence of graphs G n is LD- convergent. Then it is also right- 
convergent and 

- lim -^logZ(G n ,H) = inf U m {x, X) - S k {x, X)) . (35) 

n-Hx> \V n \ (x,X)£{0,D](k+l)xk \ J 

Remark 2. By the lower semi- continuity of Ik, the infimum over (x, X) G [0,.Dp +1 ) xfc is at- 
tained for some {x* , X*) G [0,D]( k +V xk . The right hand side can then be rewritten as £u{x*, X*) — 
Sk{x*, X*). The theorem thus states that the limiting free energy exists, and is given as the energy 
minus entropy, an identity which is usually assumed axiomatically in traditional thermodynamics. 
Here it is a consequence of LD- convergence. 

Remark 3. By Remark^ we can express the limiting free energy directly in terms of the rate 
function I. Defining £ M (p, p) = £^(x(p), X(p)) where x(p) and X(p) are given by this gives 

- lim r— - log Z(G n , H) = inf ( £ m (p, p) + I(p, p) - log k) . 

ra-»oo \V n \ (p,ii)£M 1 xM 2 V / 

Proof of Theorem^ Fix 5 > 0. Recalling the definition ([3]), let be the set of colorings 

o '. V n — y [k] such that 

Xi — 5 < Xi(a) < Xi + 5. 

for every % = 1, . . . , k and 

Xij - 5 < X i}j (a) < X i}j + 8. 
for every i, j = 1, 2, . . . , k. Note that we have the identity 

\X(x,X)\ = k^¥ k , n (B((x,X),6)"j, 

where we recall that B((x, X), 8) denotes the closed 5-ball around (x, X) with respect to the L ro 
norm, see Subsection 3^ Note also that 

£ m (x(a),X(a)) - £ m (x,X) < K5, K = (k + —j max{log«i, log A itj }, 
whenever cr G £(x,X). 

Define T$ to be the set of pairs (x, X) G [0, D]i k+l ) xk such that every Xi and every X i: j belongs 
to the set {0,5,25, . . . , \D/5}5}. Since Xij(a) < D for all a : V n — > [k], every o : V n — > [k] lies 
in U^ Xt x)ers^(x, X). As a consequence, 

Z(G n ,M) < 22 IT a<7 ^ EI ^(u),a(v) 

(x,X)&F s a£S(x,X) u£V n (u,v)eE n 

= ^2 e~ £n{x{a) ' x(a))lVnl 

(x,x)ev s aez(x,x) 

< k\ v ^ n (B((x,X),5))e- £ ^ x > x ^ +K5 ^ 



(x,x)er s 

< \T S \ max ( k^¥ k>n (b((x, X), 5) 



e -£ B (x,X)\Vn\+K5\V n \ 
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We obtain 



^logZ(G n ,H) < ^Llog|r 5 |+logA; 



max [^-\ogF k JB((x,X),5))-£u(x,X) + Ks), 
(x,x)er 5 \ \v n \ \ / / 



and hence 



limsup-^logZ(G n ,e)-logA; 

n \V n \ 

< limsup max ( logP M (b((x, X), 5)) - £ m (x, X)) + K5 

n (x,X)eT s \ \V n \ V / / 

max ( limsup ^-log¥ k JB((x,X), 5)) - £ m (x, X)) + K5 
x,x)er s \ n \V n \ V / / 

inf I k (y,Y)+£ m (x,X)) + K5 
inf (l k (y,Y)+£ m (y,Y))) +2K5 

zB((x,X),S)\ J ) 

inf (l k (y,Y)+£ m (y,Y))) +2K5 
inf (l k (y,Y)+£ M (y,Y))+2KS. 
Since this inequality holds for arbitrary 5, we obtain 

limsup-^ log Z(G n ,H) < - inf (s m (x,X) -S k (x,X)\. 

n \V n \ (x,X)e[0,D] (k+1)xkK 1 

We now establish the matching lower bound. Fix 5 > and let (x*,X*) be the minimizer of 
£u — S k . We have 

Z(G n ,U)> e" %(l(ff),X(ff))|y "' 
o-eEO*,x*) 

A;l v '"lp fc)n ( J B(( a ;*,X*),5))e^^' x *)l v '"l-^l y "l, 



(x,. 

< max I — 

(x,x)er s V (v,y)eB(( a 

< max 

(x,X)er s \ (y,Y)eB((a 

< max I — 

(x,x)er s V ( y ,Y)e[o,D] (k+1)> 



and thus 



> 



lim inf log Z(G n , H) - log fc > 



> liminf T -^ T logP fc , n ('s((x*,X*),(J)) -£ H (x*, X*) - AT5 

> liminf-^logP A; , n ( J B ((a;*,X*),5)) - 5 H (x*, X*) - Kb 

n \v n \ v y 

>- inf I k ((y,Y))-£ m (x*,X*)-K5 

(y,Y)eB°((x*,X*),5) 

> -I k (x*,X*) - £ m (x*, X*) - K5. 

Letting 5 — > we obtain the result. □ 
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5.5 LD-convergence implies partition-convergence 

We prove this result by identifying the limiting sets £(&), k > 1. In order to do this, we recall 
that by Theorem [T] every LD-convergent sequence is k-LD-convergent for all k. We denote the 
corresponding functions by 1^. 

Theorem 6. Let G n be a sequence of graphs which is LD-convergent. Then G n is partition- 
convergent. Moreover, for each k, the sets S n (fc) converge to 

E(k) = {(x, X) G [0, Df +1)xk : I k (x, X) < oo}. (36) 

The intuition behind this result is as follows. is the set of "approximately achievable" 

partitions (x,X). If some partition is approximately realizable as factor graph G n /o~ for some 
a : V n — > [k], then there are exponentially many such realizations, since changing a small linear 
in \V n \ number of assignments of a does not change the factor graph significantly Therefore, a 
uniform random partition a has at least an inverse exponential probability of creating a factor 
graph G n /a near (x,X) and, as a result, the LD rate associated with (x,X) is positive. 

On the other hand if (x,X) is e-away from any factor graph G n /a, then the likelihood that 
a uniform random partition a results in a factor graph close to (x, X) is zero, implying that the 
LD rate associated with (x, X) is infinite. We now formalize this intuition. 



Proof. We define as in (36) and prove that it is the limit of T, n (k) as n — > oo for each k. 
We assume the contrary. This means that either there exist k and eo > such that for infinitely 
many n, S n (fc) is not a subset of (S(A;)) e °, or there exist k and eo > such that for infinitely 
many n, S(/c) is not a subset of (S ra (A;)) ,:o . 

We begin with the first assumption. Then, for infinitely many n we can find (x n , X n ) G S n (/c) 
such that (x n ,X n ) has distance at least e from £(&). Denote this subsequence by (x nr ,X nr ) 
and find any limit point (x,X) of this sequence. For notational simplicity we assume that in 
fact (x nr ,X nr ) converges to (x,X) as r — > oo (as opposed to taking a further subsequence of 
n r ). Then the distance between (x,X) and S(fc) is at least eo as well. We now show that in fact 
Ik{x,X) < oo, implying that in fact (x,X) belongs to S(/c), which is a contradiction. 

To prove the claim we fix 5 > 0. We have that L ra distance between (x nr ,X nr ) and (x, X) 
is at most S for all large enough r. Since (x nr ,X nr ) e S nr (A;), then for all r there exists some 
&n r '■ Ki r — > [k] such that (x nr ,X nr ) arises from the partition a nr . Namely, /<) 
and X nr = X(G nr /cr nr ). Since a randomly uniformly chosen map a nr : V Ur — > [k] coincides with 
b Ur with probability exactly k~ nr , we conclude that 

F ((x(a nr ),X(a nr )) e B((x,X),6)) > AT"-, 

for all sufficiently large r. This implies 

]imsuv-logF ({x{a n ),X{a n )) e B((x,X),5)) 
n n> 

> limsup — logP G B((x,X),5)) 

> — log k. 

Applying property ([6j we conclude that Ik{x,X) < \ogk < oo, and the claim is established. 
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Now assume that k and eo > are such that for infinitely many n, E(&) is not a subset of 
(E n (/c)) e °. Thus there is a subsequence of points (x nr ,X nr ) G E(fc) which are at least e away 
from E nr (fc) for each r. Let G [0, £)]( fe + 1 ) xfe be any limit point of (x„ r , X nr ) , r > 1. By 

lower semi-continuity we have that the set E(/c) is closed, and thus compact. Therefore, we also 
have (x, X) G E(/c). Again, without loss of generality, we assume that in fact (x Ur , X Ur ) converges 
to (x,X). Fix any 5 < eo/2. We have (x nr ,X nr ) G -B((x,X),5) for all sufficiently large r. Then 
for all sufficiently large r we have that the distance from (x,X) to E nr (fc) is strictly larger than 
5 (since otherwise the distance from (x nr ,X nr ) to this set is less than e ). This means that for 
all sufficiently large r, 

F((x(a nr ),X{a nr )) G B{(x,X),S)) =0, 

implying 

liminf - logP ((x(a n ),X(a n )) G B((x, X), 5)) 
n n 

< liminf-logP((xK.),XK r )) G B((x,X),5)) 
= — oo. 

Applying property ^ we conclude that Ik{x, X) = oo, contradicting the fact that (x, X) G E(fc). 
This concludes the proof. □ 



6 Discussion 

6.1 Right-convergence for dense vs. sparse graphs 



It is instructive to compare our results from Section |5.4[ in particular the expression for the 
limiting free energy from Theorem [5j to the corresponding results from the theory of convergent 
sequences for dense graphs. 

For sequences G n of dense graphs, i.e., graphs with average degree proportional to the number 
of vertices, the weighted homomorphism numbers hom(G n ,EI) typically either grow or decay 
exponentially in ^(Gn)) 2 , implying that the limit of an expression of the form ^ would be 
either oo or — oo. To avoid this problem, we adopt the usual definition from statistical physics, 
where the free energy is defined by 

f(G n ,M) = _^L_l gZ(G n ,e), (37) 



where Z(G n ,M) is the partition functior^ 



Z(G n ,M) = ] [ Cta(u) ] J 

a:V(G n )^>V(W) ueV(G n ) (u,v)eE(G n ) 



1/|V(G„)| 



with a and A denoting the vector and matrix of the node and edges weights of H, respectively. 



2 Note that the only difference with respect to Q is the exponent 1/|V(<G„)|, which ensures that logZ(G r 
is of order 0(\V n \). 
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It was shown in [BCL + 12] that for a left-convergent sequence of dense graphs, the limit of 
(37) exists, and can be expressed in terms of the limiting graphon. We refer the reader to the 
literature [LS06J for the definition of the limiting graphon; for us, it is only important that it 
is a measurable, symmetric function W : [0, l] 2 — > [0, 1] which can be thought of as the limit of 
a step function representing the adjacency matrices of G n (see Corollary 3.9 of |BCL + 08] for a 
precise version of this statement). 



To express the limit of the free energies (37) in terms of W, we need some notation. Assume 
|V"(H)| = k, and let flj. be the set of measurable functions p = (pi, . . . , pk) '■ [0, 1] —> [0, l] fc such 
that pi is non-negative for all i G [k] and Pi(x) = 1 for all x G [0, 1]. Setting Xi(p) = pi(x)dx 
and X it j(p) = Jq Jq 1 pi(x)pj(y)W(x, y)dxdy } we define the energy and entropy of a "configuration" 
p G Vt k as 

£m{p) = - ^2 Xi{p) log a { ~ - X itj (p) log A id 



and 



respectively. Then 



2 

l<i<k l<i,j<k 



<S k (p) = - dx ^2 pi{x) log pi{x), 

J ° l<*<fc 

lim /(G n ,H)= inf (S u (p) - S k (p) 



Note the striking similarity to the expressions (33), (34) and (35). There is, however, one 
important difference: in the dense case, the limit object W describes an effective edge-density 
and enters into the definition of the energy, while in the sparse case, the limit object is a large- 
deviation rate and enters into the entropy. By contrast, the entropy in the dense case is trivial, 
and does not depend on the sequence G n , similar to the energy in the sparse case, which does 
not depend on G n . 

As we will see in the next subsection, this points to the fact that right-convergence is a much 
richer concept for sparse graphs than for dense graphs. Indeed, for many interesting sequences 
of dense graphs, one can explicitly construct the limiting graphon W, reducing free energy 
calculations to simple variational problems. By contrast, we do not even know how to calculate 
the rate functions for the simplest random sparse graph, the Erdos-Renyi random graph. We 
discuss this, and how it is related to spin glasses, in the next section. 



6.2 Conjectures and open questions 

While Theorem [4] establishes a complete picture regarding the relationships between LD, left, 
right and partition-convergence, several questions remain open regarding colored-neighborhood- 
convergence. In particular, not only do we not know whether LD implies colored-neighborhood- 
convergence or vice verse, but we not even know whether colored-neighborhood-convergence 
implies right-convergence. 

Since colored-neighborhood-convergence does not appear to take into account the counting 
measures involved in the definition of right-convergence, we conjecture that in fact colored- 
neighborhood-convergence does not imply right-convergence. Should this be the case it would also 
imply that colored-neighborhood-convergence does not imply LD-convergence, via Theorem |4j 
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At the same time we conjecture that the LD-convergence implies the colored-neighborhood- 
convergence. It is an interesting open problem to resolve these conjectures. 

Perhaps the most important outstanding conjecture regarding the theory of convergent sparse 
graph sequences concerns convergence of random sparse graphs. Consider two natural and much 
studied random graphs sequences, the sequence of Erdos-Renyi random graphs G(n, c/n), and the 
sequence of random regular graphs G(n, A). As usual, G(n, c/n) is the random subgraph of the 
complete graph on n nodes where each edge is kept independently with probability c/n, where c 
is an n-independent constant, and G(n, A) is a A-regular graph on n nodes, chosen uniformly at 
random from the family of all n-node A-regular graphs. We ask whether these graph sequences 
are convergent with respect to any of the definitions of graph convergence discussed above^J 

It is well known and easy to show that these graphs sequences are left- convergent, since the 
r-depth neighborhoods of a random uniformly chosen point can be described in the limit as 
n — > oo as the first r generations of an appropriate branching process. The validity for other 
notions of convergence, however, presents a serious challenge as here we touch on the rich subject 
of mathematical aspects of the spin glass theory. The reason is that right-convergence of sparse 
graph sequences is nothing but convergence of normalized partition functions of the associated 
Gibbs measures. Such convergence means the existence of the associated thermodynamic limit 
and this existence question is one of the outstanding open areas in the theory of spin glasses. 

Substantial progress has been achieved recently due to the powerful interpolation method 
introduced by Guerra and Toninelli |GT02j in the context of the Sherrington-Kirkpatrick model, 
and further by Franz and Leone |FL03| . Panchenko and Talagrand |PT04] . Bayati, Gamarnik 
and Tetali |BGT 10j. Abbe and Montanari |AM10j and Gamarnik [Gam] . In particular, a broad 



class of weighted graphs EI is identified, with respect to which the random graph sequence 
G(n,c/n) is right-convergent w.h.p. (in some appropriate sense). Some extensions of these 
results are also known for random regular graph sequences G{n, A). A large general class of 
such weighted graphs EI is described in |Gamj in terms of suitable positive semi-definiteness 
properties of the graph EI. Another class of graphs EI with respect to which convergence is 
known and in fact the limit can be computed is when EI corresponds to the ferromagnetic Ising 
model and some extensions [DM10j,[DMSllJ. We note that for the latter models, it suffices 
that the graph sequence is left-convergent to some locally tree-like graphs satisfying some mild 
additional properties. 

Still, the question whether the two most natural random graph sequences G(n,c/n) and 
G(n, A) are right-convergent (with respect to all EI) is still open. The same applies to all other 
notions of convergence discussed in this paper (except for left-convergence). We conjecture that 
the answer is yes for all of them. Thanks to Theorem [4] this would follow from the following 
conjecture. 

Conjecture 1. The random graph sequences G(n,c/n) and G(n,A) are almost surely LD- 
convergent. 

The quantifier "almost sure" needs some elaboration. Here we consider the product rin^> 
where Q n is the set of all simple graphs on n nodes. We equip this product space with the natural 
product probability measure, arising from having each Q n induced with random graph G(n, c/n) 

3 In contrast to G(n, A), the graphs G(n, c/n) do not have uniformly bounded degrees, which means that most 
of the theorems proved in this paper do not hold. Nevertheless, we can still pose the question of convergence of 
these graphs with respect to various notions. 
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measure. Our conjecture states that, almost surely with respect to this product probability space 
Q, the graph sequence realization is LD- convergent. 
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